ATGGCTCTGGGGCCCAACTGTGGCATCCTACTGTTTCTGGCTGTTTCTGGGTGTGGCCATCCCC 
AGGTTTCAAACTCGGGAAGTCGAATCGTGGGAGGGCATGCTGCCCCAGCAGGCACATGGCCGTG 
GCAGGCTAGCCTCCGTCTGCACAAGGTGCACGTGTGTGGAGGCTCCCTGCTCAGTCCAGAATGG 
GTGCTCACAGCAGCCCACTGCTTCTCTGGGTCTGTGAACTCGTCTGATTATCAGGTGCACTTGG 
GAGAGCTTACGGTCACACTGTCTCCCCACTTCTCCACTGTAAAACGGATCATCATGTACACTGG 
CTCTCCAGGACCACCGGGGTCCAGTGGGGACATTGCCCTGGTGCAGCTGTCCTCCCCGGTGGCC 
CTTTCCAGCCAGGTCCAGCCTGTGTGCCTCCCAGAGGCCTCAGCTGACTTCTACCCTGGGATGC 
AGTGCTGGGTGACTGGCTGGGGCTATACAGGGGAGGGAGAGCCTCTGAAGCCCCCATACAACCT 
TCAGGAGGCCAAAGTCTCTGTGGTGGATGTAAAGACCTGCAGCCAGGCTTACAATAGTCCCAAT 
GGCAGCCTCATCCAGCCAGACATGCTATGCGCCCGGGGCCCTGGGGATGCCTGCCAGGATGACT 
CTGGAGGGCCACTAGTCTGCCAGGTGGCTGGAACCTGGCAGCAGGCCGGCGTTGTCAGCTGGGG 
TGAGGGCTGTGGCCGCCCTGACCGCCCTGGCGTCTATGCCCGGGTTACTGCCTATGTAAACTGG 
ATCCACCACCACATCCCGGAAGCAGGGGGCTCAGGAATGCAAGGGCTTCCCTGGGCTCCTCTCC 
TGGCTGCCCTCTTCTGGCCAAGCCTCTTCCTGCTGCTGGTCTCTGGAGTCCTGATGGCCAAGTA 
CTGGCTGAGCTCTCCCTCCCACGCGGCCTCGGAACTCTGAATGAGGTGTAGCAACCAACCCAAG 
TGTCTTTCTTAAATAAGTTAGTGTTTATTCAGTTTGCTTTGCCCCTCCCCTCCCCTTAGCTTTG 
ACTTAGGAAGCCAAAGTTTTCTGCATCAGATTATTGCAACATTTAACCTGAATTTGTAGAACGG 
ATGACATAAAGCAAATGGATGTCAAAAAAAAAAA (SEQ ID N0:1) 

Q 
SI 

3 MALGPNCGILLFLAVSGCGHPQVSNSGSRIVGGHAAPAGTWPWQ 

0 

Si ASLRLHKVHVCGGSLLSPEWVLTAAHCFSGSVNSSDYQVHLGELTVTLSPHFSTVKRI 

p 

SP IMYTGSPGPPGSSGDIALVQLSSPVALSSQVQPVCLPEASADFYPGMQCWVTGWGYTG 
H EGEPLKPPYNLQEAKVSWDVKTCSQAYNSPNGSLIQPDMLCARGPGDACQDDSGGPL 
VCQVAGTWQQAGWSWGEGCGRPDRPGVYARVTAYVNWIHHHI PEAGG SGMQGL PWAP 
LLAALFWPSLFLLLVSGVLMAKYWLSSPSHAASEL (SEQ ID NO : 2 ) 



FIGURE 1 



Construct 



Gene: 372 Gl Number(s): 6103630 

Gene Family: Protease 

*? e P? .1 . Serine Protease 

Subfamily: 

Gene Sequence: full-length cDNA, Mouse 



underlined = deleted in targeting construct 

[ ] = sequence flanking Neo insert in targeting construct 



ATGGCTCTGGGGCCCAACTGTGGCATCCTACTGTTTCTGGCTGTTTCTG [ GGTGTGGCC A 
TCCCCAGGTTTCAAACTCGGGAAGTCGAATCGTGGGAGGGCATGCTGCCCCAGCAGGCAC 
ATGGCCGTGGCAGGCTAGCCTCCGTCTGCACAAGGTGCACGTGT] GTGGAGGCTCCCTGC 
TC AGTC C AGAATGGGTGC TC AC AGC AGC C C AC TGC TTC TC TGGGTC TGTGAAC TC GTCTG 
ATTATCAGGTGCACTTGGGAGAGCTTACGGTCACACTGTCTCCCCACTT [ CTCCACTGTA 
AAAC GGATC ATC ATGTAC AC TGGC TC TC C AGGACC AC C GGGGTC C AGTGGGGAC ATTGC C 
CTGGTGCAGCTGTCCTCCCCGGTGGCCCTTTCCAGCCAGGTCCAGCCTGTGTGCCTCCCA 
GAGGCCTCAGCTGACTTCTACCCTGGGATGCAGTGCTGGGTGACTGGCTGGGGCTATACA 
GGGGAGGGAG] AGCCTCTGAAGCCCCCATACAACCTTCAGGAGGCCAAAGTCTCTGTGGT 
GGATGTAAAGACCTGCAGCCAGGCTTACAATAGTCCCAATGGCAGCCTCATCCAGCCAGA 
C ATGC T ATGC GC C CGGGGC C C TGGGGATGC C TGC C AGGATGAC TC TGGAGGGC C AC TAGT 
C TGC CAGGTGGCTGGAACC TGGC AGC AGGCCGGCGTTGTC AGC TGGGGTGAGGGCTGTGG 
CCGCCCTGACCGCCCTGGCGTCTATGCCCGGGTTACTGCCTATGTAAACTGGATCCACCA 
CCACATCCCGGAAGCAGGGGGCTCAGGAATGCAAGGGCTTCCCTGGGCTCCTCTCCTGGC 
TGCCCTCTTCTGGCCAAGCCTCTTCCTGCTGCTGGTCTCTGGAGTCCTGATGGCCAAGTA 
CTGGCTGAGCTCTCCCTCCCACGCGGCCTCGGAACTCTGAATGAGGTGTAGCAACCAACC 
CAAGTGTCTTTCTTAAATAAGTTAGTGTTTATTCAGTTTGCTTTGCCCCTCCCCTCCCCT 
TAGCTTTGACTT AGGAAGC C AAAGTTTTC TGC ATC AGATT ATTGC AACATTTAAC C TG AA 
TTTGTAGAACGGATGACATAAAGCAAATGGATGTCAAAAAAAAAAA (SEQ ID NO:l) 



Gene Sequence Structure 



164 bp 



Sequence Deleted 287 bp 



Size of full-length 
cDNA: 1122 bp 




FIGURE 2A 



Targeting Vector* (genomic sequence) 
Construct Number: 1607 



Arm Length: 
5': 6 kb 
3': 0.7 kb 



— — — -/targeting "Vector: 

~ ~ ~ - " Endogenous Locus . 

©Not drawn to scale 



SI 



.5 'arm 



LacZ-Neo 
Cassette 



3* arm 




5 1 > GGAGTC ATGGAGGGC TC C C AG 
AG AAAGGGC ATTG AGC AG AATGC C 
GGTC TC C AGATTC C C TC ACC AAC A 
GTGTCTC C TC TGGATC AGGGTGTG 
GCC ATCC C C AGGTTTC AAAC TC GG 
GAAGTCGAATCGTGGGAGGGCATG 
CTGCCCCAGCAGGCACATGGCCGT 
GGCAGGCTAGCCTCCGTCTGCACA 
AGGTGACGTGT<3 1 
(SEQ ID NO: 3) 



5 • >CTCCACTGTAAAACGGATCAT 
C ATGTAC AC TGGCTC TC C AGGAC C 
ACCGGGGTCCAGTGGGGACATTGC 
CCTGGTGCAGCTGTCCTCCCCGGT 
GGCCCTTTCCAGCCAGGTCCAGCC j 
TGTGTGC C TC C C AGAGGC C TC AGC 
TGACTTCTACCCTGGGATGCAGTG 
CTGGGTGACTGGCTGGGGCTATAC 
AGGGGAGGGAG< 3 1 
(SEQ ID NO: 4) 



si 

s 



FIGURE 2B 



